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Binary Trees 



Definition A binary tree is either empty, or it consists of a 
node called the root together with two binary trees called 
the left subtree and the right subtree of the root. 



There is one empty binary tree, one binary tree with one node, 
and two with two nodes: 




and 




These are different from each other. We never draw any part of a 
binary tree to look like 



o 



o 



The binary trees with three nodes are: 




Binary Trees 

Transp. 2, Sect. 10.1, Introduction to Binary Trees 
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Traversal of Binary Trees 



At a given node there are three tasks to do in some order: Visit 
the node itself (V); traverse its left subtree (L); traverse its right 
subtree (R). There are six ways to arrange these tasks: 



VLR LVR LRV VRL RVL RLV. 



By standard convention, these are reduced to three by consid- 
ering only the ways in which the left subtree is traversed before 
the right. 



These three names are chosen according to the step at which the 
given node is visited. 

■ With preorder traversal we first visit a node, then traverse 
its left subtree, and then traverse its right subtree. 

■ With inorder traversal we first traverse the left subtree, 
then visit the node, and then traverse its right subtree. 

■ With postorder traversal we first traverse the left subtree, 
then traverse the right subtree, and finally visit the node. 



VLR 



LVR 

inorder 



LRV 



preorder 



postorder 



Traversal of Binary Trees 

Transp. 3, Sect. 10.1, Introduction to Binary Trees 
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Expression Trees 



a 



a + h 



b 




log X 



n 





a-{bxc) 



{a < b) or (c < d) 



Expression: a -\-b logx n\ a — (b x c) (a < b) or (c < d) 



Preorder : + a b log x In — a x b c or < a b < c d 
Inorder : a + b log x n I a — b x c a < b or c < d 
Postorder : a b + x log n I a b c x — a b < c d < or 



Expression Trees 

Transp. 4, Sect. 10.1, Introduction to Binary Trees 
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Linked Binary Trees 

Comparison tree: 




Linked Binary Trees 

Transp. 6, Sect. 10.1, Introduction to Binary Trees 
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Linked Binary Tree Specifications 



Binary tree class: 

template < class Entry > 
class BinaryJree { 
public: 

// Add methods here. 
protected: 

// Add auxiliary function prototypes here. 
Binary _node< Entry > *roo\; 

}; 



Binary node class: 

template < class Entry > 
struct Binary _node { 

// data members: 
Entry data; 

Binary _node< Entry > -Heft; 
Binary _node< Entry > >'^right; 
// constructors: 
Binary _node(); 
Binary _node(const Entry &x); 

}; 



Linked Binary Tree Specifications 

Transp. 7, Sect. 10.1, Introduction to Binary Trees 
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Constructor: 



template < class Entry > 

Binary _tree< Entry > : : Binary _tree( ) 

/* Post: An empty binary tree has been created. -/ 

{ 

root = NULL; 

} 



Empty: 

template < class Entry > 

bool Binary_tree<Entry> :: emptyO const 

/" Post: A result of true is returned if tine binary tree is empty Otiierwise, false is 
returned. -/ 

{ 

return root == NULL; 

} 



Binary tree methods 

Transp. 8, Sect. 10.1, Introduction to Binary Trees 
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Inorder traversal: 



template < class Entry > 

void Binary_tree<Entry> :: inorder(void (>'-visit)(Entry &)) 
/" Post: The tree has been been traversed in infix order sequence. 
Uses: r/?e fivncf/on recursiveJnorder -/ 

{ 

recursive_inorder(root, visit); 

} 

Most Binary_tree methods described by recursive processes can 
be implemented by calling an auxiliary recursive function that 
applies to subtrees. 

template < class Entry > 
void Binary_tree<Entry> :: 

recursive _inorder(Binary_node< Entry > >'^sub_root, 

void (>wisit)(Entry &)) 
/" Pre: sub_root is e/f/?er NULL or points to a subtree of the Binary Jree. 
Post: The subtree has been traversed in inorder sequence. 
Uses: The function recursive_inorder recursively *l 

{ 

if (sub_root ! = NULL) { 
recursive jnorder(sub_root->left, visit); 
(>'vvisit)(sub_root->data); 
recursiveJnorder(sub_root->riglit, visit); 

} 

} 



Binary tree methods 

Transp. 9, Sect. 10.1, Introduction to Binary Trees 
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Binary Tree Class Specification 



template < class Entry > 
class BinaryJree { 

public: 

Binary _tree(); 

bool ennpty( ) const; 
void preorder(void visit) (Entry &)); 
void inorder(void (>vvisit)(Entry &)); 
void postorder(void (>vvisit)(Entry &)); 

int size( ) const; 

void clear(); 

int lieiglit() const; 

void insert(const Entry &); 

BinaryJree (const Binary _tree< Entry > &original); 

Bi nary _tree & operator = (const Binary _tree< Entry > &original); 

~ Binary _tree(); 
protected: 

// Add auxiliary function prototypes here. 
Binary _node< Entry > ^^root; 
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Binary Search Trees 



Can we find an implementation for ordered lists in which 
we can search quickly (as with binary search on a contigu- 
ous list) and in which we can make insertions and deletions 
quickly (as with a linked list)? 



Definition A binary search tree is a binary tree that is 
either empty or in which the data entry of every node has a 
key and satisfies the conditions: 

1. The key of the left child of a node (if it exists) is less than 
the key of its parent node. 

2. The key of the right child of a node (if it exists) is greater 
than the key of its parent node. 

3. The left and right subtrees of the root are again binary 
search trees. 



We always require: 
No two entries in a binary search tree may have equal keys. 



■ We can regard binary search trees as a new ADT. 

■ We may regard binary search trees as a specialization of bi- 
nary trees. 

■ We may study binary search trees as a new implementation 
of the ADT ordered list. 



Binary Search Trees 

Transp. 11, Sect. 10.2, Binary Search Trees 
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The Binary Search Tree Class 



■ The binary search tree class will be derived from the binary 
tree class; hence all binary tree methods are inherited. 

template < class Record > 

class Search_tree: public Binary _tree< Record > { 
public: 

Errorxode insert(const Record &new_data); 

Error_code rennove(const Record &old_data); 

Error_code tree_searcli(Record &target) const; 
private: // Add auxiliary function prototypes here. 

}; 

■ The inherited methods include the constructors, the destruc- 
tor, clear, empty, size, height, and the traversals preorder, inorder, 
and postorder. 

■ A binary search tree also admits specialized methods called 
insert, remove, and tree_search. 

■ The class Record has the behavior outlined in Chapter 7: Each 
Record is associated with a Key. The keys can be compared 
with the usual comparison operators. By casting records to 
their corresponding keys, the comparison operators apply to 
records as well as to keys. 



The Binary Search Tree Class 

Transp. 12, Sect. 10.2, Binary Search Trees 
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Tree Search 



Error_code Search_tree<Record> :: 

tree_search (Record &target) const; 

Post: If there is an entry in the tree whose key matches that 
in target, the parameter target is replaced by the corre- 
sponding record from the tree and a code of success is 
returned. Otherwise a code of not_present is returned. 

■ This method will often be called with a parameter target that 
contains only a key value. The method will fill target with the 
complete data belonging to any corresponding Record in the 
tree. 

■ To search for the target, we first compare it with the entry 
at the root of the tree. If their keys match, then we are fin- 
ished. Otherwise, we go to the left subtree or right subtree as 
appropriate and repeat the search in that subtree. 

■ We program this process by calling an auxiliary recursive 
function. 

■ The process terminates when it either finds the target or hits 
an empty subtree. 

■ The auxiliary search function returns a pointer to the node 
that contains the target back to the calling program. Since 
it is private in the class, this pointer manipulation will not 
compromise tree encapsulation. 

Binary _node< Record > Search _tree< Record > : : search_for_node( 
Binary _node< Record >>'- sub_root, const Record &target) const; 

Pre: sub_root is NULL or points to a subtree of a Search_tree 

Post: If the key of target is not in the subtree, a result of NULL 
is returned. Otherwise, a pointer to the subtree node 
containing the target is returned. 



Tree Search 

Transp. 13, Sect. 10.2, Binary Search Trees 
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Recursive auxiliary function: 

template < class Record > 

Binary _node< Record > >'-Search_tree< Record > : : search_for_node( 
Binary _node< Record >■'- sub_root, const Record &target) const 

{ 

if (sub_root == NULL || sub_root->data == target) 

return sub_root; 
else if (sub_root->data < target) 

return searcli_for_node(sub_root->riglit, target); 
else return searcli_for_node(sub_root->left, target); 

} 

Nonrecursive version: 

template < class Record > 

Binary _node< Record > >'v Searcli_tree< Record > : : searcli_for_node( 
Binary _node< Record > >'^sub_root, const Record &target) const 

{ 

while (sub_root != NULL&&sub_root->data != target) 
if (sub_root->data < target) sub_root = sub_root->riglit; 
else sub_root = sub_root->left; 

return sub_root; 

} 



Auxiliary functions, tree search 

Transp. 14, Sect. 10.2, Binary Search Trees 
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Public method for tree search: 



template < class Record > 
Error.code Search_tree<Record> :: 

tree_search (Record &target) const 
/" Post: If there is an entry in tine tree wtiose i<ey matciies tiiat in target, tiie 
parameter target is repiaced by tiie corresponding record from tiie tree 
and a code of success is returned. Otiierwise a code of not_present 
is returned. 
Uses: fivncf/on search_for_node *l 

{ 

Error_code result = success; 

Binary _node< Record > >'^found = search_for_node(root, target); 
if (found == NULL) 

result = not_present; 
else 

target = found ->data; 
return result; 



Tree search functions 

Transp. 15, Sect. 10.2, Binary Search Trees 
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Binary Search Trees with the Same Keys 




Binary Search Trees with the Same Keys 
Transp. 16, Sect. 10.2, Binary Search Trees 
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Analysis of Tree Search 



■ Draw the comparison tree for a binary search (on an ordered 
hst). Binary search on the Hst does exactly the same compar- 
isons as tree_search will do if it is applied to the comparison 
tree. By Section 7.4, binary search performs 0{\ogn) com- 
parisons for a list of length n . This performance is excellent 
in comparison to other methods, since log n grows very slowly 
as n increases. 

■ The same keys may be built into binary search trees of many 
different shapes. 

■ If a binary search tree is nearly completely balanced ("bushy"), 
then tree search on a tree with n vertices will also do O (log n) 
comparisons of keys. 

■ If the tree degenerates into a long chain, then tree search be- 
comes the same as sequential search, doing 0 (n) comparisons 
on n vertices. This is the worst case for tree search. 

■ The number of vertices between the root and the target, in- 
clusive, is the number of comparisons that must be done to 
find the target. The bushier the tree, the smaller the number 
of comparisons that will usually need to be done. 

■ It is often not possible to predict (in advance of building it) 
what shape of binary search tree will occur. 

■ In practice, if the keys are built into a binary search tree 
in random order, then it is extremely unlikely that a binary 
search tree degenerates badly; tree_search usually performs 
almost as well as binary search. 



Analysis of Tree Search 

Transp. 17, Sect. 10.2, Binary Search Trees 
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Insertion into a Binary Search Tree 



Error_code Search_tree<Record> :: 

insert(const Record &new_data); 

Post: If a Record with a key matching that of new_data al- 
ready belongs to the SearchJree a code of duplicate_error 
is returned. Otherwise, the Record new_data is inserted 
into the tree in such a way that the properties of a bi- 
nary search tree are preserved, and a code of success 
is returned. 




(a) Insert e (b) Insert b (c) Insert d 




(f) Insert g (g) Insert C 



Insertion into a Binary Search Tree 
Transp. 18, Sect. 10.2, Binary Search Trees 
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Method for Insertion 



template < class Record > 

Error_code Search_tree<Record> :: insert(const Record &new_data) 
{ 

return search_and_insert(root, new_data); 

} 



template < class Record > 

Error_code Search_tree<Record> :: search_andjnsert( 

Binary _node< Record > * &sub_root, const Record &new_data) 

{ 

if (sub_root == NULL) { 
sub_root = new Binary _node< Record >(new_data); 
return success; 

} 

else if (new_data < sub_root->data) 
return searcli_andJnsert(sub_root->left, new_data); 

else if (new_data > sub_root->data) 
return searcli_andjnsert(sub_root->riglit, new.data); 

else return duplicate_error; 



The method insert can usually insert a new node into a ran- 
dom binary search tree with n nodes in 0(\ogn) steps. It 
is possible, but extremely unlikely, that a random tree may 
degenerate so that insertions require as many as n steps. 
If the keys are inserted in sorted order into an empty tree, 
however, this degenerate case will occur. 



Method for Insertion 

Transp. 19, Sect. 10.2, Binary Search Trees 
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Treesort 



■ When a binary search tree is traversed in inorder, the keys 
will come out in sorted order. 

■ This is the basis for a sorting method, called treesort: Take 
the entries to be sorted, use the method insert to build them 
into a binary search tree, and then use inorder traversal to 
put them out in order. 



Theorem 10. 1 Treesort makes exactly the same comparisons 
of keys as does quicksort when the pivot for each sublist is 
chosen to be the first key in the sublist. 



Corollary 10.2 In the average case, on a randomly ordered 
list of length n , treesort performs 

2^1n^ + 0{n) ^ 1.39nlgn + 0{n) 

comparisons of keys. 



■ First advantage of treesort over quicksort: The nodes need 
not all be available at the start of the process, but are built 
into the tree one by one as they become available. 

■ Second advantage: The search tree remains available for later 
insertions and removals. 

■ Drawback: If the keys are already sorted, then treesort will 
be a disaster — the search tree it builds will reduce to a chain. 
Treesort should never be used if the keys are already sorted, 
or are nearly so. 



Treesort 

Transp. 20, Sect. 10.2, Binary Search Trees 
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Removal from a Binary Search Tree 




Case: deletion of a leaf Case: one subtree empty 




Case: neither subtree empty 



Removal from a Binary Search Tree 
Transp. 21, Sect. 10.2, Binary Search Trees 
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Auxiliary Function to Remove One Node 



template < class Record > 

Error_code Search_tree<Record> :: rennove_root(Binary_node< Record > 

* &sub_root) 

/" Pre: sub_root is either N\Jll or points to a subtree of tine Search_tree. 

Post: /f sub_root is NULL, a code of not_present is returned. Otiierwise, t\ie root of 
tine subtree is removed in sucii a way tiiat tiie properties of a binary searcii tree 
are preserved. Tine parameter sub_root is reset as tine root of tine modified 
subtree, anc/ success is returned. -/ 

{ 

if (sub_root == NULL) return not_present; 
Binary _node< Record > -'vto-delete = sub_root; 

// Remember node to deiete at end. 
if (sub_root->riglit == NULL) sub_root = sub_root->left; 
else if (sub_root->left == NULL) sub.root = sub_root->riglit; 

else { // Neitiner subtree is empty 

to.delete = sub_root->left; // Move ieft to find predecessor 
Binary_node< Record > >'v parent = sub_root; // parenf of to_delete 
while (to_delete->right !=NULL){ // to _de I ete /s nof f/?e prec/ecessor. 

parent = to_delete; 

to_delete = to_delete->right; 

} 

sub_root->data = to .delete ->data; // Move from to_delete to root. 
if (parent == sub_root) sub_root->left = to_delete->left; 
else parent->right = to_delete->left; 

} 

delete to_delete; // f?emoi/eto_delete /rom free, 

return success; 



Auxiliary Function to Remove One Node 
Transp. 22, Sect. 10.2, Binary Search Trees 
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Removal Method 

template < class Record > 

Error_code Search_tree<Record> :: rennove(const Record &target) 

/" Post: If a Record with a key matching that of target belongs to the Search_tree a 

code of success is returned and the corresponding node is removed from the 

tree. Otherwise, a code of not_present is returned. 
Uses: FL/ncf/on search_and_destroy ■'-/ 

{ 

return search_and_destroy(root, target); 

} 

As usual, this method uses an auxihary recursive function that 
refers to the actual nodes in the tree. 

template < class Record > 

Error_code Search_tree<Record> :: search_and_destroy( 

Binary _node< Record >>'- &sub_root, const Record &target) 
/" Pre: sub_root is either NULL or points to a subtree of the Search_tree. 

Post: If the key of target is not in the subtree, a code of not_present is returned. 
Otherwise, a code of success is returned and the subtree node containing 
target has been removed in such a way that the properties of a binary search 
tree have been preserved. 
Uses: Fivncf/ons search_and_destroy recursively and remove joo\ *I 

{ 

if (sub_root == NULL || sub_root->data == target) 

return rennove_root(sub_root); 
else if (target < sub_root->data) 

return search_and_destroy(sub_root->left, target); 
else 

return search_and_destroy(sub_root->right, target); 



Removal Method 

Transp. 23, Sect. 10.2, Binary Search Trees 
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Linked Binary Tree with Sentinel 




Linked Binary Tree with Sentinel 
Transp. 24, Sect. 10.2, Binary Search Trees 
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Information Retrieval Project 



■ Purpose of project: Compare several different kinds of binary 
search trees useful for information retrieval. The current, 
first part of the project is to produce a driver program and the 
information-retrieval package using ordinary binary search 
trees. 

■ Outline of project: 

1. Create the data structure (binary search tree). 

2. Ask the user for the name of a text file and open it to read. 

3. Read the file, split it apart into individual words, and in- 
sert the words into the data structure. With each word 
will be kept a frequency count and, when duplicate words 
are encountered, the frequency count will be increased. 
The same word will not be inserted twice in the tree. 

4. Print the number of comparisons done and the CPU time 
used in part 3. 

5. If the user wishes, print out all the words in the data struc- 
ture, in alphabetical order, with their frequency counts. 

6. Put everything in parts 2-5 into a do . . . while loop that will 
run as many times as the user wishes. Thus the user can 
build the data structure with more than one file if desired. 
By reading the same file twice, the user can compare time 
for retrieval with the time for the original insertion. 



Information Retrieval Project 

Transp. 25, Sect. 10.2, Binary Search Trees 
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Further Specifications, Information Retrieval 



■ The input to the driver will be a file. The program will be 
executed with several different files; the name of the file to be 
used should be requested from the user while the program is 
running. 

■ A word is defined as a sequence of letters, together with apos- 
trophes (') and hyphens (-), provided that the apostrophe or 
hyphen is both immediately preceded and followed by a let- 
ter. Uppercase and lowercase letters should be regarded as 
the same (by translating all letters into either uppercase or 
lowercase, as you prefer). A word is to be truncated to its first 
20 characters (that is, only 20 characters are to be stored in 
the data structure) but words longer than 20 characters may 
appear in the text. Nonalphabetic characters (such as digits, 
blanks, punctuation marks, control characters) may appear 
in the text file. The appearance of any of these terminates a 
word, and the next word begins only when a letter appears. 

■ Be sure to write your driver so that it will not be changed at 
all when you change implementation of data structures later. 



Further Specifications, Information Retrieval 
Transp. 26, Sect. 10.2, Binary Search Trees 
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Function Specifications, Information Retrieval 



void update(const String &word, 

Search_tree< Record > &structure, int &nunn_connps); 

Post: If word was not already present in structure, then word 
has been inserted into structure and its frequency count 
is 1. If word was already present in structure, then its 
frequency count has been increased by 1. The variable 
parameter num.connps is set to the number of compar- 
isons of words done. 



void print(const Search_tree< Record > &structure); 

Post: All words in structure are printed at the terminal in al- 
phabetical order together with their frequency counts. 



void write_nnethod(); 

Post: The function has written a short string identifying the 
abstract data type used for structure. 



Function Specifications, Information Retrieval 
Transp. 27, Sect. 10.2, Binary Search Trees 
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Building a Balanced Binary Search Tree 



Problem: Start with an ordered list and build its entries into a 
binary search tree that is nearly balanced ("bushy"). 

If the nodes of a complete binary tree are labeled in inorder 
sequence, starting with 1 , then each node is exactly as many 
levels above the leaves as the highest power of 2 that divides 
its label. 




n = l n = 2 n = 3 n = 4: n = 5 




n = 21 

To establish future links, we must remember pointers to one node 
on each level, the last node processed on that level. 
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Algorithm Development 

■ As each new node is added, it is clearly the last one received 
in the order, so we can place it in the List last_node and set its 
right pointer to NULL (at least temporarily). 

■ The left pointer of the new node is NULL if it is a leaf. Otherwise 
it is the entry in last_node one level lower than the new node. 

■ So that we can treat the leaves in the same way as other nodes, 
we consider the leaves to be on level 1, and we set up the initial 
element of last_node, in position 0, to have the pointer value 
NULL permanently. 

■ This convention means that we shall always count levels above 
the leaves inclusively, so the leaves themselves are on level 1, 
and so on. 

■ While we build up a tree, we need access to the internal struc- 
ture of the tree in order to create appropriate links. Therefore, 
the new function will be implemented as a (public) method for 
a class of search trees. We will therefore create a new class 
called a BuildableJree that is derived from the class Search_tree 
and possesses a new method, buildJree. The specification for 
a buildable tree is thus: 

template < class Record > 

class BuildableJree: public Search_tree< Record > { 
public: 

Error_code build_tree(const List< Record > &supply); 
private: // Add auxiliary function prototypes here. 

}; 
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Method to Build Binary Search Tree 



template < class Record > 

Error_code Buildable_tree<Record> :: 

build_tree(const List< Record > &supply) 
/" Post: If the entries of supply are in increasing order, a code of success is returned 
and tine Search Jree is buiit out of tiiese entries as a balanced tree. Otiierwise, 
a code of fail is returned and a balanced tree is constructed from the longest 
increasing sequence of entries at the start of supply. 
Uses: The methods of class List; the functions buildJnsert, connect_subtrees, 
anc/ find_root -/ 

{ 

Error_code ordered_data = success; 

// Set this to fai I if keys do not increase. 
int count = 0; // number of entries inserted so far 

Record x, last_x; 

List < Binary _node< Record > * > last_node; 

// pointers to last nodes on each level 
Binary _node< Record > -''none = NULL; 

last_node.insert(0, none); // permanently N\Jll (for children of leaves) 

while (supply. retrieve(count, x) == success) { 
if (count > 0 &&X <= last_x) { 
ordered.data = fail; 
break; 

} 

build_insert(++count, x, last_node); 
last_x = x; 

} 

root = find_root(last_node); 
connect_trees(last_node); 

return ordered_data; // Report any data-ordering problems back to client. 



Method to Build Binary Search Tree 
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Inserting a Node 



template < class Record > 
void Buildable_tree<Record> :: 
buildJnsert(int count, const Record &new_data, 

List < Binary _node< Record >>'- > &last_node) 
/" Post: A new node, containing the Record new_data, has been inserted as the 
rightmost node of a partially completed binary search tree. The level of this new 
node is one more than the highest power of 2 that divides count. 
Uses: Methods of class Us\ -/ 

{ 

1 nt I eve I ; // level of new node above the leaves, counting inclusively 

for (level = 1; count % 2== 0; level++) 

count /= 2; // Use count to calculate level of next_node. 

Binary _node< Record > 

>'vnext_node = new Binary _node< Record >(new_data), 
* parent; // one level higher in last_node 

last_node.retrieve(level — 1, next_node->left); 
if (last_node.size() <= level) 

last_node.insert(level, next_node); 
else 

last_node. replace(level, next_node); 
if (last_node.retrieve(level + 1, parent) == success && 

parent->right == NULL) 

parent->right = next_node; 
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Finishing the Task 



last_node 




n = 21 



Finding the root: 



template < class Record > 

Binary _node< Record > >'vBuildable_tree<Record> ::find_root( 

List < Binary _node< Record >'V > &last_node) 
/" Pre: The list last_node contains pointers to the last node on each occupied 
level of the binary search tree. 
Post: A pointer to the root of the newly created binary search tree is returned. 
Uses: Methods of class L\s\ -/ 

{ 

Binary _node< Record > >vhigh_node; 
last_node.retrieve(last_node.size() — 1, high_node); 

// Find root in the highest occupied level in last_node. 
return high_node; 

} 
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Tying Subtrees Together 



template < class Record > 

void Buildable_tree<Record> ::connect_trees( 

const List < Binary _node< Record >>'^ > &last_node) 
/" Pre: The nearly-completed binary search tree has been initialized. List last_node 
has been initialized and contains links to the last node on each level of the tree. 

Post: The final links have been added to complete the binary search tree. 

Uses: Methods of class L\s\ *l 

{ 

Binary _node< Record > 
-'^iiigii.node, // from last.node with NULL right child 

'V|ow_node; // candidate for right child of liigli_node 

int liigliJevel = last_node.size() — 1, 
lowJevel; 

while (liigliJevel > 2) { // Nodes on levels 1 and 2 are already OK. 
last_node.retrieve(high_level, high_node); 
if (high_node->right != NULL) 

h i g h _l eve I — ; // Search down for highest dangling node. 

else { // Case: undefined right tree 

lowJevel = highJevel; 

do { // Find the highest entry not in the left subtree. 

last_node.retrieve( — lowJevel, low_node); 
} while (low_node != NULL&& 

low_node->data < high_node->data); 
high_node->right = low_node; 
highJevel = lowJevel; 

} 

} 

} 
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Random Search Trees and Optimality 



Problem: If we assume that the keys have been inserted into a 
binary search tree in random order, then, on average, how many 
more comparisons are needed in a search of the resulting tree 
than would be needed in a completely balanced tree? 




There is a one-to-one correspondence between binary search 
trees and 2-trees in which left and right are considered dif- 
ferent from each other. 



■ Assume that the n\ possible orderings of keys are equally 
likely in building the binary search tree. 

■ When there are n nodes in the tree, we denote by S(n) the 
number of comparisons done in the average successful search 
and by U(n) the number in the average unsuccessful search. 

■ Recurrence relation: 

S{n) = 1 H . 

n 
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Performance of Random Binary Search Trees 

Relation between internal and external path length: 

S{n) = + U{n) - 3. 

Solve, subtracting case n — 1 from case n : 

4 

U(n) = U(n - 1) + 



n-\-l 



Evaluate by expanding down to case 0 and using the Harmonic 
Number: 

11 1 . 

Hn = l-{ \ \ h-^ Inn. 

2 3 n 



Theorem 10.3 The average number of nodes visited during 
a search of the average binary search tree with n nodes is 
approximately 

2lnn = (21n2)(lgn) ^ 1.39 Ign, 

and the number of key comparisons is approximately 

4\nn = (41n2)(lgn) ^ 2.77\gn. 



Corollary 10.4 The average binary search tree requires ap- 
proximately 21n2 ~ 1.39 times as many comparisons as a 
completely balanced tree. 
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Height Balance: AVL Trees 



Definition: 

An AVL tree is a binary search tree in which the heights of 
the left and right subtrees of the root differ by at most 1 and 
in which the left and right subtrees are again AVL trees. 

With each node of an AVL tree is associated a balance 
factor that is left higher, equal, or right higher according, 
respectively, as the left subtree has height greater than, equal 
to, or less than that of the right subtree. 



History: 

The name AVL comes from the discoverers of this method, 
G. M. Adel'son-Vel'skii and E. M. Landis. The method dates 
from 1962. 



Convention in diagrams: 

In drawing diagrams, we shall show a left-higher node by V,' a 
node whose balance factor is equal by ' — ,' and a right-higher 
node by '\.' 



Height Balance: AVL Trees 

Transp. 37, Sect. 10.4, Height Balance: AVL Trees 
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C++ Conventions for AVL Trees 



■ We employ an enumerated data type to record balance factors: 
enum BalanceJactor { left_higher, equaLheight, right_higher }; 

■ AVL nodes are structures derived from binary search tree 
nodes with balance factors included: 

template < class Record > 

struct AVL_node: public Binary _node< Record > { 

// additional data member: 

BalanceJactor balance; 
// constructors: 

AVL_node(); 

AVL_node(const Record &x); 
// overridden virtual functions: 
void set_balance(Balance_factor b); 
BalanceJactor get_balance( ) const; 

}; 

■ In a Binary_node, left and right have type Binary _node so the 
inherited pointer members of an AVL_node have this type too, 
not the more restricted AVL_node In the insertion method, 
we must make sure to insert only genuine AVL nodes. 

■ The benefit of implementing AVL nodes with a derived struc- 
ture is the reuse of all of our functions for processing nodes of 
binary trees and search trees. 



C++ Conventions for AVL Trees 
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Methods for Balance Factors 



template < class Record > 

void AVL_node<Record> :: set_balance(Balance_factor b) 
{ 

balance = b; 

} 

template < class Record > 

Balance Jactor AVL_node< Record > : : get_balance( ) const 
{ 

return balance; 

} 



■ We often invoke these methods through pointers to nodes, 
such as left->get_balance(). But left could (for the compiler) 
point to any Binary _node, not just to an AVL.node, and these 
methods are declared only for AVL_node. 

■ A C++ compiler must reject a call such as left->get_balance(), 
since left might point to a Binary _node that is not an AVL_node. 



Methods for Balance Factors 

Transp. 40, Sect. 10.4, Height Balance: AVL Trees 
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Dummy Methods and Virtual Methods 



■ To enable calls such as left->get_balance(), we include dummy 
versions of get_balance() and set_balance( ) in the underlying 
Binary_node structure. These do nothing: 

template < class Entry > 

void Binary_node<Entry> :: set_balance(Balance .factor b) 
{} 

template < class Entry > 

Balance -factor Binary_node<Entry> :: get_balance() const 
{return equaLheight; } 

■ The correct choice between the AVL version and the dummy 
version of the method can only be made at run time, when the 
type of the object -left is known. 

■ We therefore declare the Binary_node versions of set_balance 
and get_balance as virtual methods, selected at run time: 

template < class Entry > 
struct Binary _node { 
// data members: 
Entry data; 

Binary _node< Entry > '"-left; 

Binary _node< Entry > >'^right; 
// constructors: 

Binary _node(); 

Binary _node(const Entry &x); 
// virtual methods: 

virtual void set_balance(Balance_factor b); 

virtual Balance Jactor get_balance( ) const; 

}; 



Dummy Methods and Virtual Methods 

Transp. 41, Sect. 10.4, Height Balance: AVL Trees 
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Class Declaration for AVL Trees 



■ We must override the earlier insertion and deletion func- 
tions for binary search trees with versions that maintain the 
balanced structure of AVL trees. 

■ All other binary search tree methods can be inherited without 
any changes. 

template < class Record > 

class AVL_tree: public Search_tree< Record > { 

public: 

Error_code insert(const Record &new_data); 
Error_code remove(const Record &old_data); 

private: // Add auxiliary function prototypes here. 

}; 

■ The inherited data member of this class is the pointer root, 
which has type Binary _node< Record > * and thus can store the 
address of either an ordinary binary tree node or an AVL tree 
node. 

■ We must ensure that the overridden insert method only creates 
nodes of type AVL_node; doing so will guarantee that all nodes 
reached via the root pointer of an AVL tree are AVL nodes. 



C++ Conventions for AVL Trees 
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Insertions into an AVL tree 




k, m: 





Insertions into an AVL tree 

Transp. 43, Sect. 10.4, Height Balance: AVL Trees 
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Public Insertion IVIethod 



template < class Record > 

Error_code AVL_tree<Record> :: insert(const Record &new_data) 
/" Post: If the /cey ofnew_data is already in the AVLAree, a code ofduplicate_error 
is returned. Otherwise, a code of success is returned and the Record 
new_data is inserted into the tree in such a way that the properties of an AVL 
tree are preserved. 
Uses: avLinsert. -/ 

{ 

bool taller; // Has the tree grown in height? 

return avLinsert(root, new_data, taller); 

} 



Recursive Function Specifications 

template < class Record > 
Error_code AVL_tree<Record> :: 

avlJnsert(Binary_node< Record > * &sub_root, 
const Record &new_data, bool &taller) 
/" Pre: sub.root is either N\Jll or points to a subtree of the AVL.tree 

Post: If the key of new.data is already in the subtree, a code of duplicate.error 
is returned. Otherwise, a code of success is returned and the Record 
new_data is inserted into the subtree in such a way that the properties of 
an AVL tree have been preserved. If the subtree is increased in height, the 
parameter taller is set to true; otherwise it is set to false. 
Uses: Methods obstruct AVL_node; functions av\jnser\ recursively 
left_balance, anc/ right_balance. */ 



Recursive Function Specifications 

Transp. 44, Sect. 10.4, Height Balance: AVL Trees 
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Recursive Insertion 



Error.code result = success; 
if (sub_root == NULL) { 

sub_root = new AVL_node< Record >(new_data); 

taller = true; 

} 

else if (new_data == sub_root->data) { 
result = duplicate_error; 
taller = false; 

} 

else if (new_data < sub_root->data) { // Insert in left subtree. 
result = avLinsert(sub_root->left, new_data, taller); 
if (taller == true) 
switch (sub_root->get_balance()) { // Change balance factors. 
case left_higher: 
left_balance(sub_root); 

taller = false; // Rebalancing always shortens the tree. 

break; 

case equaLheight: 
sub_root->set_balance(left_higher); 
break; 

case right_higher: 
sub_root->set_balance(equaLheight); 
taller = false; 
break; 

} 

} 



Recursive Insertion 
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Recursive Insertion, Continued 



else { // Insert in right subtree. 

result = avljnsert(sub_root->right, new_data, taller); 
if (taller == true) 
switch (sub_root->get_balance()) { 
case left_higher: 
sub_root->set_balance(equaLheight); 
taller = false; 
break; 

case equaLheight: 
sub_root->set_balance(right_higher); 
brealc; 

case right_higher: 
right_balance(sub_root); 

taller = false; // Rebalancing always shortens the tree. 

brealc; 

} 

} 

return result; 



Recursive Insertion, Continued 

Transp. 46, Sect. 10.4, Height Balance: AVL Trees 
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Rotations of an AVL Tree 




Total height = h + 3 T^^al height =h + 2 

template < class Record > 

void AVL_tree<Record> :: rotate Jeft(Binary_node< Record > &sub_root) 
/" Pre: sub_root points to a subtree of the AVL_tree. This subtree has a nonempty 
right subtree. 

Post: sub_root is reset to point to its former right child, and the former sub_root 
node is the left child of the new sub.root node. *l 

{ 

if (sub_root == NULL || sub_root->right == NULL) // impossible cases 
cout « "WARNING: program error detected in rotateJeft" « end!; 
else { 

Binary _node< Record > >'vriglit_tree = sub_root->riglit; 
sub_root->riglit = riglit_tree->left; 
riglit_tree->left = sub_root; 
sub_root = riglitJree; 

} 

} 



Rotations of an AVL Tree 
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Double Rotation 



- 1 sub tree 



h 



. , becomes 
I right_tree ^ 



sub tree 



h-1 
or 

h 



h 




right_tree 



h 



h-1 
or 

h 



h 



Total height =h + 2 



One of T2 or has height h. 
Total height = h + 3 



The new balance factors for root and rightJree depend on the pre- 
vious balance factor for sub_tree: 



old subJree new root new rightJree new subJree 

/ - \ 

\ / - 



Double Rotation 
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template < class Record > 
void AVL_tree< Record > : : 

right_balance(Binary_node< Record > * &sub_root) 
/* Pre: sub.root points to a subtree of an AVLJree, doubly unbalanced on the right. 

Post: The AVL properties have been restored to the subtree. 

Uses: Methods of struct AVL_node; functions rotate_right, rotate_left. *l 

{ 

Binary _node< Record > * &right_tree = sub_root->right; 
switch (right_tree->get_balance()) { 
case right_higher: // single rotation left 

sub_root->set_balance(equaLheight); 
right_tree->set_balance(equaLheight); 
rotate J eft (sub_root); break; 
case equaLheight: // impossible case 

cout « "WARNING: program error in right_balance" « endl; 
case left_higher: // double rotation left 

Binary _node< Record > *sub_tree = right_tree->left; 
switch (sub_tree->get_balance( )) { 
case equaLheight: 
sub_root->set_balance(equal_height); 
right_tree->set_balance(equaLheight); break; 
case left_higher: 
sub_root->set_balance(equaLheight); 
right_tree->set_balance(right_higher); break; 
case right_higher: 
sub_root->set_balance(left_higher); 
right_tree->set_balance(equaLheight); break; 

} 

sub_tree->set_balance(equaLheight); 
rotate_right(right_tree); 
rotate_left(sub_root); break; 

} 

} 



Fianction to restore right balance 

Transp. 49, Sect. 10.4, Height Balance: AVL Trees 



Data Structures and Program Design In C++ 
369 © 1999 Prentice-Hall, Inc., Upper Saddle River, N.J. 07458 



Removal of a Node 



1. Reduce the problem to the case when the node x to be removed 
has at most one child. 

2. Delete x. We use a bool variable shorter to show if the height 
of a subtree has been shortened. 

3. While shorter is true do the following steps for each node p on 
the path from the parent of x to the root of the tree. When 
shorter becomes false, the algorithm terminates. 

4. Case 1: Node p has balance factor equal. The balance factor 
of p is changed according as its left or right subtree has been 
shortened, and shorter becomes false. 

5. Case 2: The balance factor of p is not equal, and the taller 
subtree was shortened. Change the balance factor of p to 
equal, and leave shorter as true. 

6. Case 3: The balance factor of p is not equal, and the shorter 
subtree was shortened. Apply a rotation as follows to restore 
balance. Let q be the root of the taller subtree of p. 

7. Case 3a: The balance factor of q is equal. A single rotation 
restores balance, and shorter becomes false. 

8. Case 3b: The balance factor of q is the same as that of p. 
Apply a single rotation, set the balance factors of p and q to 
equal, and leave shorter as true. 

9. Case 3c: The balance factors of p and q are opposite. Ap- 
ply a double rotation (first around q, then around p), set the 
balance factor of the new root to equal and the other balance 
factors as appropriate, and leave shorter as true. 



Removal of a Node 
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Example of deletion from an AVL tree 

Transp. 52, Sect. 10.4, Height Balance: AVL Trees 
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Analysis of AVL Trees 



■ The number of recursive calls to insert a new node can be as 
large as the height of the tree. 

■ At most one (single or double) rotation will be done per inser- 
tion. 

■ A rotation improves the balance of the tree, so later insertions 
are less likely to require rotations. 

■ It is very difficult to find the height of the average AVL tree, 
but the worst case is much easier. The worst-case behavior of 
AVL trees is essentially no worse than the behavior of random 
search trees. 

■ Empirical evidence suggests that the average behavior of AVL 
trees is much better than that of random trees, almost as good 
as that which could be obtained from a perfectly balanced tree. 



Analysis of AVL Trees 
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Data Structures and Program Design In C++ 
373 © 1999 Prentice-Hall, Inc., Upper Saddle River, N.J. 07458 



Worst-Case AVL Trees 



■ To find the maximum height of an AVL tree with n nodes, we 
instead find the minimum number of nodes that an AVL tree 
of height h can have. 

■ Let Fh be such a tree, with left and right subtrees F/ and 
Fr . Then one of F/ and Fr , say Fi , has height h — 1 and the 
minimum number of nodes in such a tree, and F^ has height 
h — 2 with the minimum number of nodes. 

■ These trees, as sparse as possible for AVL trees, are called 
Fibonacci trees. 




Worst-Case AVL Trees 
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Analysis of Fibonacci Trees 



■ If we write | T \ for the number of nodes in a tree T , we then 
have the recurrence relation for Fibonacci trees: 



\Fh\ = \Fh-i \ + \Fh-2 \ + 1, 
where |Fo| = 1 and = 2. 

By the evaluation of Fibonacci numbers in Section A. 4, 

■ -[h+2 

1 l + x/5 



Take the logarithms of both sides, keeping only the largest 
terms: 

h ^ lAA\g\Fh\. 

The sparsest possible AVL tree with n nodes has height about 
1.44 Ig^ compared to: 

■ A perfectly balanced binary search tree with n nodes has 
height about Ig n . 

■ A random binary search tree, on average, has height about 
1.39 Ign. 

■ A degenerate binary search tree has height as large as n . 



■ Hence the algorithms for manipulating AVL trees are guar- 
anteed to take no more than about 44 percent more time than 
the optimum. In practice, AVL trees do much better than this 
on average, perhaps as small as Ig^ + 0.25. 



Worst-Case AVL Trees 
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Splay Trees: A Self-Adjusting Data Structure 



■ In some applications, we wish to keep records that are newly 
inserted or frequently accessed very close to the root, while 
records that are inactive may be placed far off, near or in the 
leaves. 

■ We make a binary search tree into a self-adjusting data 
structure that automatically changes its shape to bring rec- 
ords closer to the root as they are more frequently accessed, 
allowing inactive records to drift slowly out toward the leaves. 

■ In a splay tree, every time we access a node, whether for 
insertion or retrieval, we lift the newly-accessed node all the 
way up to become the root of the modified tree. 

■ We use rotations like those of AVL trees, but now with many 
rotations done for every insertion or retrieval in the tree. 

■ We move the target node two levels up the tree at each step. 
Consider the path going from the root down to the accessed 
node. If we move left, we say that we zig, and if we move right 
we say that we zag. A move of two steps left (going down) is 
then called zig-zig, two steps right zag-zag, left then right 
zig-zag, and right then left zag-zig. If the length of the path 
is odd, either a single zig move or a zag move occurs at the 
end. 



Splay Trees: A Self-Adjusting Data Structure 
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Splay Rotations 




Splay Rotations 
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Notes on Splay Rotations 



■ The zig-zag case is identical to that of an AVL double rotation, 
and the zig case is identical to a single rotation. 

■ The zig-zig case is not the same as would be obtained by lifting 
the target node twice with single rotations. 

■ Always think of lifting the target two levels at a time (except 
for a single zig or zag step at the end). 

■ It is only the nodes on the path from the target to the root 
whose relative positions are changed, and that only in the 
ways shown by colored dashed curves in the figures. 

■ None of the subtrees off the path (Ti, T2, T^, and changes 
its shape at all, and these subtrees are attached to the path 
in the only places they can go to maintain the search-tree 
ordering of all the keys. 



Notes on Splay Rotations 
Transp. 58, Sect. 10.5, Splay Trees 
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Zig-Zig and Single Rotations 



A zig-zig movement is not the same as would be obtained by lifting 
the target node twice with single rotations. 




Zig-Zig and Single Rotations 
Transp. 59, Sect. 10.5, Splay Trees 
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Top-Down and Bottom-Up Splaying 



■ By hand, we perform bottom-up splaying, beginning at the 
target node and moving up the path to the root two steps at 
a time. A single zig or zag move may occur at the top of the 
tree. Bottom-up splaying is essentially a two-pass method, 
first searching down the tree and then splaying the target up 
to the root. 

■ In a computer algorithm, we splay from the top down while we 
are searching for the target node. When we find the target, it 
is immediately moved to the root of the tree, or, if the search 
is unsuccessful, a new root is created that holds the target. 
In top-down splaying, a single zig or zag move occurs at the 
bottom of the splaying process. 

■ If you run the splaying function we develop on the example 
trees, you will obtain the same results as doing bottom-up 
splaying by hand when the target is moved an even number 
of levels, but the results will be different when it moves an 
odd number of levels. 



Top-Down and Bottom-Up Splaying 
Transp. 61, Sect. 10.5, Splay Trees 
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Algorithm Development 



■ Given a target key, we develop a function that searches down 
the tree for the key, splaying as it goes. If it finds the key, 
then it retrieves it; if not, then the function inserts it in a new 
node. In either case, the node with the target key becomes 
the root of the tree. 

■ We implement splay trees as a derived class of class SearchJree. 



template < class Record > 

class Splay Jree: public Search_tree< Record > { 

public: 

Error_code splay(const Record &target); 
private: // Add auxiliary function prototypes here. 

}; 



■ A splay tree does not keep track of heights and does not use 
any balance factors like an AVL tree. 

■ Top-down splaying uses the same moves (zig-zig, zig-zag, and 
the rest) as bottom-up splaying, but expressed differently. 

■ During splaying, the root is left empty so that, at the end, the 
target node can be moved or inserted directly into the root. 



Algorithm Development 

Transp. 62, Sect. 10.5, Splay Trees 
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The Three-Way Tree Partition 



■ During splaying, the tree temporarily falls apart into three 
separate subtrees, which are reconnected after the target is 
made the root. 

■ The central subtree contains nodes within which the tar- 
get will lie if it is present. 

■ The smaller-key subtree contains nodes whose keys are 
strictly less than the target. Every key in the smaller-key 
subtree is less than every key in the central subtree. 

■ The larger-key subtree contains nodes whose keys are 
strictly greater than the target. Every key in the larger- 
key subtree is greater than every key in the central sub- 
tree. 

■ These conditions hold throughout the splaying process; we 
call them the three-way invariant. 




Keys less If present, target Keys greater 

than target. is in central subtree. than target. 

■ Initially, the central subtree is the whole tree, and the smaller- 
key and larger-key subtrees are empty. 

■ As splaying proceeds, nodes are stripped off the central sub- 
tree and joined to one of the other two subtrees. 

■ When the search ends, the root of the central subtree will be 
the target node if it is present, and the central subtree will be 
empty if the target was not found. 

■ All the components will finally be joined together with the 
target as the root. 



Algorithm Development 

Transp. 63, Sect. 10.5, Splay Trees 
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Basic Action: linl(_right 



■ Suppose the target is smaller than the key in the root of the 
central subtree. 

■ In this case, we take the root and its right subtree and adjoin 
them to the larger-key tree, reducing the central subtree to 
the former left subtree of the root. 

■ We call this action link_right. 

■ link_right is exactly a zig move, except that the link from the 
former left child down to the former parent is deleted; instead, 
the parent (with its right subtree) moves into the larger-key 
subtree. 

■ The three-way invariant tells us that every key in the cen- 
tral subtree comes before every key in the larger-key subtree; 
hence this parent (with its right subtree) must be attached on 
the left of the leftmost node (first in ordering of keys) in the 
larger-key subtree. 

■ Note that, after link_right is performed, the three-way invariant 
continues to be true. 



Basic Action: link_right 

Transp. 64, Sect. 10.5, Splay Trees 
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Action of link_right: a Zig Move 




After: 




smaller-key subtree new central subtree new larger-key subtree 



Action of link_right: a Zig Move 
Transp. 65, Sect. 10.5, Splay Trees 
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Case: target is less than left child of root. 




smaller-key subtree ~ central subtree larger-key subtree 




Zig-zig performed as rotate.right; link.right 
Transp. 66, Sect. 10.5, Splay Trees 
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Case: target is between root and its left child. 




smaller-key subtree central subtree new larger-key subtree 



After linkjeft: 



small 




middle 



large 



new smaller-key subtree 



new central subtree 



new larger-key subtree 



Zig-zag performed as link.right; linkjeft 
Transp. 67, Sect. 10.5, Splay Trees 
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Programming Conventions 



■ We use five pointer variables to keep track of required posi- 
tions in the three subtrees: 

■ current gives the root of the central subtree of nodes not 
yet searched. 

■ child refers to either the left or right child of current, as 
required. 

■ last.small gives the largest (that is, the rightmost) node in 
the smaller-key subtree, which is the place where addi- 
tional nodes must be attached. 

■ first_large gives the smallest (that is, the leftmost) node 
in the larger-key subtree, which is the place where addi- 
tional nodes must be attached. 

■ dummy points to a dummy node that is created at the be- 
ginning of the splaying function and is deleted at its end. 

■ The dummy node contains no key or other data. 

■ We initialize both last.small and firstJarge to dummy. Hence 
these pointers always refer to actual nodes, and therefore 
link_right and linkJeft will not attempt to dereference NULL point- 
ers. 



Programming Conventions 
Transp. 68, Sect. 10.5, Splay Trees 
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C++ Functions 



template < class Record > 
void Splay _tree< Record > : : 

link_right(Binary_node< Record > &current, 

Binary _node< Record > * &firstJarge) 
/" Pre: The po/nferf irst_large points to an actual Binary _node (in particular, it is not 
NULLj. The three-way invariant holds. 
Post: The node referenced by cur rent (with its right subtree) is linked to the left of the 
node referenced by firstJarge. The pointer first_large is reset to current. 
The three-way invariant continues to hold. *l 

{ 

firstJarge ->left = current; 
firstJarge = current; 
current = current->left; 

} 

The rotation functions do not use the dummy node, and they do 
not cause any change in the three-way partition. 

template < class Record > 
void Splay Jree< Record > : : 

rotate_right(Binary_node< Record > * &current) 
/" Pre: current points to the root node of a subtree of a Binary Jree. This subtree 
has a nonempty left subtree. 
Post: current is reset to point to its former left child, and the /brmer current node is 
the right child of the new cu rrent node. *l 

{ 

Binary _node< Record > -'-left Jree = current->left; 
current->left = left_tree->right; 
leftJree->right = current; 
current = leftJree; 

} 



C++ Functions 

Transp. 69, Sect. 10.5, Splay Trees 



Data Structures and Program Design In C++ 
389 © 1999 Prentice-Hall, Inc., Upper Saddle River, N.J. 07458 



Finishing the Task 



■ When the search finishes, the root of the central subtree points 
at the target node or is NULL. 

■ If the target is found, it must become the root of the whole tree, 
but, before that, its left and right subtrees are now known 
to belong in the smaller-key and larger-key subtrees, respec- 
tively, so they should be moved there. 

■ If the search instead terminates unsuccessfully, with current == 
NULL, then a new root containing target must be created. 

■ Finally, the left and right subtrees of the new root should now 
be the smaller-key and larger-key subtrees. 

■ How do we find the roots of these subtrees, since we have 
kept pointers only to their rightmost and leftmost nodes, re- 
spectively? 

■ To answer this question, let us remember what happened at 
the beginning of the search. Initially, both pointers last.small 
and firstJarge were set to refer to the dummy node. When a node 
(and subtree) are attached to the larger-key subtree, they are 
attached on its left, by changing firstJarge->left. Since firstJarge 
is initially dummy, we can now, at the end of the search, find 
the first node inserted into the larger-key subtree, and thus its 
root, simply as dummy->left. Similarly, dummy-> right points to 
the root of the smaller-key subtree. Hence the dummy node 
provides us with pointers to the roots of the smaller- and 
larger-key subtrees that would otherwise be lost. 

■ Note that the pointers are stored in positions reversed from 
what one might expect. 



Finishing the Task 

Transp. 70, Sect. 10.5, Splay Trees 
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After splaying: 



dummy 





current 







target 











last small 



smaller-key subtree 





central subtree 



first_large 



larger-key subtree 



Reconnect into one tree: 



dummy ^ T^ > ? 




root 



current > 



target 







smaller-key subtree 




larger-key subtree 



Reconnecting the subtrees 
Transp. 71, Sect. 10.5, Splay Trees 
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Splaying: The Final IVIethod 



template < class Record > 

Error_code Splay_tree<Record>::splay(const Record &target) 
/" Post: If a node of the splay tree has a key matching that of target, it has been moved 
by splay operations to be the root of the tree, and a code of entry Jound is 
returned. Otherwise, a new node containing a copy of target is inserted as the 
root of the tree, and a code of entry .inserted is returned. *l 

{ 

Binary _node< Record > >'vdunnnny = new Binary _node< Record >; 
Binary _node< Record > >'vcurrent = root, 

''Hast.smai! = dummy, 
'VfirstJarge = dummy; 

// Search for target while splaying the tree. 
while (current != NULL&&current->data ! = target) 
if (target < current->data) { 
child = current->left; 

if (child == NULL II target == child->data) // zigmove 

link_right(current, firstJarge); 
else if (target < child->data) { // zig-zigmove 

rotate_right(current); 

link_right(current, firstJarge); 

} 

else { // zig-zag move 

link_right(current, firstJarge); 
linkJeft(current, last_small); 

} 

} 



Splaying: The Final Method 
Transp. 72, Sect. 10.5, Splay Trees 
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Splaying Method, Continued 



else { // case: target > current->data 

child = current->right; 
if (child == NULL || target == child->data) 

linkJeft(current, last.small); // zagmove 
else if (target > child->data) { // zag-zagmove 

rotate_left(current); 

linkJeft(current, last.small); 

} 

else { // zag-zig move 

linkJeft(current, last.small); 
link_right(current, firstJarge); 

} 

} 

// Move root to the current node, which is created if necessary. 
Error_code result; 

if (current == NULL) { // Search unsuccessful: mai<e a new root. 

current = new Binary _node< Record >(target); 
result = entry .inserted; 
last_snnall->right = firstJarge->left = NULL; 

} 

else { // successful search 

result = entry Jound; 

last_snnall->right = current->left; // Move remaining central nodes. 
firstJarge->left = current->right; 

} 

root = current; // Define the new root. 

root -> r i g ht = d u m my -> I ef t ; // root of larger-key subtree 
root -> I ef t = d u m my -> r i g ht ; // root of smaller-key subtree 
delete dummy; 
return result; 



Splaying Method, Continued 
Transp. 73, Sect. 10.5, Splay Trees 
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Amortized Algorithm Analysis: Introduction 



■ In both worst-case analysis and average-case analysis, we take 
a single event or single situation and attempted to determine 
how much work an algorithm does to process it. 

■ Amortized analysis differs in that it considers a long sequence 
of events rather than a single event in isolation. Amortized 
analysis then gives a worst-case estimate of the cost of a long 
sequence of events. 

■ One event in a sequence may affect the cost of later events. 
One task may be difficult, but it may leave a data structure 
in a state where the tasks that follow become much easier. 

■ In finance, amortization means to spread a large expense over 
a period of time. Accountants amortize a large capital expen- 
diture over it lifetime. Insurance actuaries amortize high-risk 
cases over the general population. 

■ Amortized analysis is not the same as average-case analysis, 
since the former considers a sequence of related situations and 
the latter all possible independent situations. 



Amortized Algorithm Analysis: Introduction 
Transp. 74, Sect. 10.5, Splay Trees 
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Tree Traversal 



Consider the inorder traversal of a binary tree, where we measure 
the cost of visiting one vertex as the number of branches traversed 
to reach that vertex from the last one visited. 




■ The best-case cost of visiting a vertex is 1, when we go from a 
vertex to one of its children or to its parent. 

■ The worst-case cost, for a tree with n vertices, is /i — 1, as 
shown by the tree that is one long chain to the left, where it 
takes n — 1 branches to reach the first (leftmost) vertex. 

■ The amortized cost of going from each vertex to the next over 
the traversal of any binary tree is less than 2. 



Tree Traversal 

Transp. 75, Sect. 10.5, Splay Trees 
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Credit Balance: Making Costs Level 



■ We invent a function, which we call a credit balance, chosen 
so that it will be large when the next operation is expensive 
and smaller when the next operation can be done quickly. 

■ Think of the credit balance as helping to bear some of the cost 
of expensive operations. For inexpensive operations, we set 
aside more than the actual cost, using the excess to build up 
the credit balance for future use. 



Definition The amortized cost ai of each operation in a 
sequence of m operations is defined to be Ui = ti + c/ — 
for / = 1, 2, . . . , m, where U is the actual cost of operation 
/ , Co is the credit balance before the first operation, and Ci is 
the credit balance after operation / , for 1 < / <m. 



Goal: Choose the credit-balance function so as to make 
the amortized costs ai as nearly equal as possible, no matter 
how the actual costs ti may vary. 



Lemma 10.5 The total actual cost and total amortized cost 
of a sequence of m operations on a data structure are related 
by 

m / ^ \ 

^ = I y^<2/j +Co - Cm. 

i=l \i=l I 



Credit Balance: Making Costs Level 
Transp. 76, Sect. 10.5, Splay Trees 
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Incrementing Binary Integers 



Consider an algorithm that continually increments a binary (base 
2) integer by 1. The cost of each step is the number of bits (binary 
digits) that are changed from one number to the next. 



step i 
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= actual cost = number of digits changed 
Ci = credit-balance function = number of I's in integer 
ai = amortized cost = ti + Cf — 



Incrementing Binary Integers 
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Amortized Analysis of Splaying 



Let r be a binary search tree, J) be T as it is after step / of 
splaying, Ti{x) be the subtree with root x in 7}, |7^(x)| be the 
number of nodes in Ti{x), and define the rank of x to be (x) = 



The Credit Invariant 

For every node x of r and after every step / of splaying, 
node X has credit equal to its rank {x) . 



The total credit balance for the tree is defined as the sum of the 
individual credits for all the nodes in the tree. 



If the tree is empty or contains only one node, then its credit bal- 
ance is 0. As the tree grows, its credit balance increases, and this 
balance should reflect the work needed to build the tree. 



Lemma 10.6 If a, ^, and y are positive real numbers with 
a -\- P <y, then \ga -\-\g^ < 2\gy - 2. 



lg\Ti(x)\. 




xeTi 



Amortized Analysis of Splaying 
Transp. 78, Sect. 10.5, Splay Trees 
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Amortized Complexity of Splay Steps 



Lemma 10.7 If the / splaying step is a zig-zig or zag-zag 
step at node x , then its amortized complexity at satisfies the 
inequality ai < 3(ri(x) — rj_i(x)). 




Lemma 10.8 If the / splaying step is a zig-zag or zag-zig 
step at node x , then its amortized complexity satisfies 

ai < 2(r,(x) - r,-i(x)). 



Lemma 10.9 If the / splaying step is a zig or a zag step at 
node X , then its amortized complexity a/ satisfies 

ai <1 + [ri{x) - r/_i(x)). 



Amortized Complexity of Splay Steps 
Transp. 79, Sect. 10.5, Splay Trees 
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Amortized Complexity of Splaying 



■ To find the total amortized cost of a retrieval or insertion, 
we must add the costs of all the splay steps done during the 
retrieval or insertion. 



Theorem 10.10 The amortized cost of an insertion or re- 
trieval with splaying in a binary search tree with n nodes 
does not exceed 

l + 31g^ 

upward moves of the target node in the tree. 



■ Finally, we can relate this amortized cost to the actual cost of 
each of a long sequence of splay insertions or retrievals. 



Corollary 10.11 The total complexity of a sequence of m in- 
sertions or retrievals with splaying in a binary search tree 
that never has more than n nodes does not exceed 

m(l -\- 3\g n) -\- n\g n 

upward moves of a target node in the tree. 



■ In this result, each splaying step counts as two upward moves, 
except for zig or zag steps, which count as one move each. 

■ The fact that insertions and retrievals in a splay tree, over a 
long sequence, are guaranteed to take only O (log n) time is 
quite remarkable, given that, at any time, it is quite possible 
for a splay tree to degenerate into a highly unbalanced shape. 



Amortized Complexity of Splaying 
Transp. 80, Sect. 10.5, Splay Trees 
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Pointers and Pitfalls 



1. Consider binary search trees as an alternative to ordered lists. 
At the cost of an extra pointer field in each node, binary search 
trees allow random access (with O(log^) key comparisons) to 
all nodes while maintaining the flexibility of linked lists for 
insertions, deletions, and rearrangement. 

2. Consider binary search trees as an alternative to tables. At 
the cost of access time that is 0(\ogn) instead of 0(1), bi- 
nary search trees allow traversal of the data structure in the 
order specified by the keys while maintaining the advantage 
of random access provided by tables. 

3. In choosing your data structures, always carefully consider 
what operations will be required. Binary trees are especially 
appropriate when the requirements include random access, 
traversal in a predetermined order, and flexibility in making 
insertions and deletions. 

4. While choosing data structures and algorithms, remain alert 
to the possibility of highly unbalanced binary search trees. 
AVL trees provide nearly perfect balance at a slight cost in 
computer time and space, but with considerable programming 
cost. If it is necessary for the tree to adapt dynamically to 
changes in the frequency of the data, then a splay tree may 
be the best choice. 

5. Binary trees are defined recursively; algorithms for manipu- 
lating binary trees are usually, but not always, best written 
recursively. 

6. Although binary trees are most commonly implemented as 
linked structures, remain aware of the possibility of other im- 
plementations. 



Pointers and Pitfalls 

Transp. 81, Chapter 10, Binary Trees 
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